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HDU53215 2384 bp DNA linear BCT 15-MAY-1997 

Haemophilus ducreyi cytolethal distending toxin protein A (cdtA) , 
protein B (cdtB) and protein C (cdtC) genes, complete cds . 
U53215 

U53215.1 GI:2102681 

Haemophilus ducreyi 
Haemophilus ducreyi 

Bacteria; Proteobacteria; gamma subdivision; Pasteurellaceae; 
Haemophilus . 

1 (bases 1 to 2384) 

Cope,L.D., Lumbley,S., Latimer , J. L. , Klesney-Tait , J. , Stevens , M. K. , 
Johnson, L. S. , Purven,M., Munson,R.S. Jr., Lagergard, T. , Radolf,J.D.. 
and Hansen, E.J. 

A diffusible cytotoxin of Haemophilus ducreyi 

Proc. Natl. Acad. Sci. U.S.A. 94 (8), 4056-4061 (1997) 

97268696 

9108104 

2 (bases 1 to 2384) 
Hansen, E. 

Direct Submission 

Submitted (01-APR-1996) Microbiology, UT Southwestern Medical 
Center, 6000 Harry Hines Blvd NA6.200, Dallas, TX 75235-9048, USA 
Location/Qualifiers 
1. .2384 

/organism="Haemophilus ducreyi" 
/strain="35000" 
/ db_xr e f = " t axon : 7 3 0 " 
152. .823 
/gene="cdtA" 
152. .823 
/gene="cdtA" 
/codon_start=l 
/ trans l_table=ll 

/product = "cytolethal distending toxin protein A" 
/protein_id="AAB57725 . 1" 
/db_xref="GI: 2102682" 

/ trans lation= "MKKFLPSLLLMGSVACSSNQRMNDYSQPESQSDLAPKSSTIQPQ 
PQPLLSKTPSMSLNLLSSSGPNRQVLPSEPSNFMTLMGQNGALLTWALAKRNWLWAY 
PNIYSQDFGNIRNWKMEPGKHREYFRFVNQSLGTCVEAYGNGLIHDICSLDKLAQEFE 
LLPTDSGAVVIKSVSQGRCVTYNPVSTTFYSTVTLSVCDGATEPSRDQTWYLAPPVLE 
ATAVN" 
838 . . 1689 
/gene="cdtB" 
838. .1689 
/gene="cdtB" 
/codon__start=l 
/transl_table=ll 

/product=" cytolethal distending toxin protein B" 
/protein_id= "AAB57726 . 1" 
/db_xref="GI:2102683" 

/ trans 1 a t ion= " MQWA/TCQLS WFCVMLFSFSS YANLSDFKVATWNLQGS SAVNES K 
WNINVRQLLSGEQGADILMVQEAGSLPSSAVRTSRVIQHGGTPIEEYTWNLGTRSRPN 
MWIYYSRLDVGANRVNIxAIVSRRQADEAFIVliSDSSvljQSRPAVGIRIGTDVFFTVH 
ALATGGSDAVS L IRN I FTTFNS S S S P PERRVYS WMWGDFNRAP ANLE VALRQEPAVS 
ENTI I IAPTEPTHRSGNILDYAILHDAHLPRREQARERIGASLMLNQLRSQITSDHFP 
VSFVRDR" 
1700. .2260 
/gene="cdtC n 
1700 . .2260 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?cmd=&txt^n&save=^ 11/20/02 
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/gene= n cdtC" 

/codon_start=l 

/transl_table=ll 

/product ="cytole thai distending toxin protein C" 
/protein_id="AAB57727. 1" 
/db_xref="GI: 2102684" 

/trans la tion="MKKYLLSFLLIMILALASHAESNPDPTTYPDVELSPPPRISLRS 
LLTAQPVKNDHYDSHNYLSTHWELIDYKGKEYEKLRDGGTLVQFKWGAAKCFAFLGK 
GTTDCKDTDHTVFNLIPTNTGAFLIKDALLGFCITSHDFDDLKLEPCGGSVSGRTFSL 
AYQWGILPPFGPSKILIPPVRRNQGS " 



BASE COUNT 



714 a 



416 c 494 g 760 t 



ORIGIN 

% 1 agaaaagata ttgaacaggt tctaacataa agtataataa aggttcatat gcttttttac 

9 61 gatattatct ggtttggttt tatcaaaaaa aaggataaaa tgcgaagaac ttgtcctttt 

121 aaatttaagg atggatctaa ggagagatat aatgaaaaag tttttaccta gtcttttatt 
181 gatgggttca gtggcttgtt catcaaatca acgaatgaat gactattctc aacctgaatc 
241 tcaatctgat ttagcaccta aatcttcaac aatacaaccc caacctcaac ccctattatc 
301 aaaaacacct tcaatgtcac tgaatttgct atcttcatcc ggaccgaata gacaggtatt 
361 gccgtctgaa ccatcaaact ttatgacttt gatgggacaa aatggggcac tgttgactgt 
421 ctgggcgcta gcaaaacgca attggttatg ggcttatccc aatatatatt cgcaggactt 
481 tggaaatatt cgtaattgga agatggaacc cggtaaacac cgtgaatatt ttcgttttgt 
541 taatcaatct ttaggtacat gtgttgaagc ttacggtaat ggtttaattc atgatatttg 
601 tagtctggac aaattagcac aagagtttga gttattacct actgatagtg gtgcggttgt 
661 cattaaaagt gtgtcacaag ggcgttgtgt cacttataat cctgtaagta caacatttta 
721 ttcaacagtt acattatcag tttgtgatgg cgcaacagaa ccatcacgtg atcaaacatg 
781 gtatctcgct ccccctgtat tagaagcaac agcggttaat taaactaagg agtttatatg 
841 caatgggtaa agcagttaag tgtggttttc tgtgtgatgt tatttagctt ttcaagttat 
901 gctaacttga gtgacttcaa agtagcaact tggaatctgc aaggttcttc agcagtaaat 
961 gaaagtaaat ggaatattaa tgtgcgccaa ttattatcgg gagaacaagg tgcagatatt 
1021 ttgatggtac aagaagcggg ttccttacca agttcggcag taagaacctc acgggtaatt 
1081 caacatgggg gaacgccaat tgaggaatat acttggaatt taggtactcg ttcccgccca 
1141 aatatggtct atatttatta ttctcgttta gatgttgggg caaaccgagt gaacttagct 
1201 atcgtgtcac gccgtcaagc cgatgaagct tttatcgtac attctgattc ttctgtgctt 
1261 caatctcgcc ctgcagtagg tatccgcatt ggtactgatg tattttttac agtgcatgct 
1321 ttggccacag gcggttctga tgcggtaagt ctgattcgta atatcttcac tacttttaac. 
13 81 tcatcatcat ccccaccgga aagacgagta tatagctgga tggttgttgg tgatttcaat 
1441 cgtgcgccgg ctaatctgga agttgcatta agacaggagc ccgcagtgag tgaaaataca 
1501 attattattg cgccaacaga accgactcat cgatctggta atattttaga ttatgcaatt 
1561 ttacatgatg cacatttacc acgtagagaa caggcccgtg aacgtatcgg tgcaagttta 
1621 atgttaaatc agttacgctc acaaattaca tccgatcatt ttcctgttag ttttgttcgt 
1681 gatcgctaag gaggatatta tgaaaaaata tttattgagc ttcttattaa tcatgatatt 
1741 ggctttggcg agtcatgcag aatcaaatcc tgatccgact acttatcctg atgtagagtt 
1801 atcacctcct ccacgtatta gcttgcgtag tttgcttacg gctcaaccag ttaaaaatga 
1861 tcattatgat tcacataatt acttgagtac acattgggaa ttaattgatt acaaaggaaa 
1921 agaatatgaa aaattacgtg acggtggtac gttagttcaa tttaaagtgg ttggtgcagc 
1981 aaaatgtttt gctttcctgg gcaaaggcac aactgattgt aaagatactg atcatactgt 
2041 gtttaacctt attccaacta atacgggcgc gtttttaatc aaagatgcac tattagggtt 
2101 ttgtataaca agccatgact ttgatgattt gaagcttgaa ccttgtggag gttcagtgag 
2161 tggtcgaacc ttttcgttgg cgtatcaatg ggggatatta cctccttttg gaccaagtaa 
2221 aattttaata ccaccggtgc gaagaaatca gggtagctaa tgttttacat ataattgtat 
2281 ttcttcaaat caagatcctt agtggggcga agaaatataa tgtcattatt gtgcttatgt 
2341 taatgatcat gcaaaaatga gccaggcaga cgcagtaaga teat 



// 
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Pickett , Microbiology/ Immunology, 
Rose St., Lexington, KY 40536, USA 



gene 



CDS 



gene 



CDS 



gene 



CDS 



CJU51121 2600 bp DNA linear BCT 28-AUG-1996 

Campylobacter jejuni cytolethal distending toxin (cdtA) , (cdtB) , 
(cdtC) genes, complete cds, and LctP gene, partial cds. 
U51121 

U51121.1 GI:1354273 

Campylobacter jejuni 
^Campylobacter jejuni 

Bacteria; Proteobacteria; epsilon subdivision; Campylobacter group; 
Campylobacter . 

1 (bases 1 to 2600) 

Pickett, C. L. , Pesci,E.C, Cottle, D.L., Russell, G., Erdem,A.N. and 
Zeytin,H. 

Prevalence of cytolethal distending toxin production in 
Campylobacter jejuni and relatedness of Campylobacter sp. cdtB gene 
Infect. Immun. 64 (6), 2070-2078 (1996) 
96239019 
8675309 

2 (bases 1 to 2600) 
Pickett, C.L. 
Direct Submission 
Submitted (ll-MAR-1996) Carol L. 
University of Kentucky, UKMC 800 

Location/Qualifiers 
1. .2600 

/organi sm= " Campylobacter j e j uni " 
/strain="81-176" 
/ db_xr e f = " t axon : 1 9 7 " 
141. .947 
/gene="cdtA" 
141. .947 
/gene="cdtA" 
/codon_start=l 
/ 1 r ans l_t ab 1 e = 1 1 
/product="CdtA" 
/protein_id="AAB06707 .1" 
/db_xref="GI : 1354274" 

/ trans lation="MQKIIVFILCCFMTFFLYACSSKFENVNPLGRSFGEFEDTDPLK 
LGLEPTFPTNQE I PS L I SGADLVP ITP ITPPLTRTSNS ANNNAANG INPRFKDEAFND 
VLIFENRPAVSDFLTILGPSGAALTVWALAQGNWIWGYTLIDSKGFGDARWQLLLYP 
NDF AM I KNAKTNTCLNAYGNG I VHYPCDASNHAQMWKL I PMSNTAVQ I KNLGNGKC I Q 
APITNLYGDFHKVFKIFTVECAKKDNFDQQWFLTTPPFTAKPLYRQGEVR " 
944. .1741 
/gene=" cdtB" 
944. .1741 
/gene="cdtB" 
/codon_start=l 
/ trans l_table= 11 
/product="CdtB" 
/protein_id="AAB06708 .1" 
/db_xref="GI : 1354275" 

/ trans la tion="MKKIICLFLSFNLAFANLENFNVGTWNLQGSSAATESKWSVSVR 
QLVSGANPLDILMIQEAGTLPRTATPTGRHVQQGGTPIDEYEWNLGTLSRPDRVFIYY 
S R VD VG ANR VNL A I VS RMQ AE E V I VLP P PTTVS RP 1 1 G I RNGND AF FN I HALANGGTD 
VGAIITAVI)AHFANMPQvT^MIAGDFNRDPSTITSTvIDRELANRIRW 
GTLDYAITGNSNRQQTYTPPLLAAILMLASLRSHIVSDHFPVNFRKF" 
1752 . .2321 
/gene="cdtC" 
1752 . .2321 
/gene="cdtC" 



http://www.ncbi.nlm.nih. gov/entrez/viewer.fcgi?cmd=&txt^n&save=^ 11/20/02 
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/codon_start=l 
/transl_table=ll 
/product="CdtC" 
/protein_id="AAB06709. 1" 
/db_xref = M GI : 1354276 " 

/ trans la t ion= " MKKI ITLFFMFITLAFATPTGDLKDFTEMVS IRSLETGI FLSAF 

RDTSKDPIDQNWNIKEIVL.SDELKQKDKLADELPFGYVQFTNPKESDLCLAILEDGTF 

GAKSCQDDLKDGKLETVFSIMPTTTSAVQIRSLVLESDECIVTFFNPNIPIQKRFGIA 

PCTLDPI FFAEVNELMI ITPPLTAATPLE " 
gene 2502 . . 2600 

/gene="LctP" 
CDS 2502..>2600 

/gene="LctP" 

/codon_start=l 

/trans l__table=ll 

/protein_id= "AAB06710 . 1" 

/db_xref ="GI : 1511637" 

/ 1 rans 1 a t ion= " MLTWQQI YDPFSNIWLSALVAFLP ILCFLVCLV " 
BASE COUNT 873 a 402 c 421 g 904 t 

ORIGIN 

n 1/' 1 tgctaaaata taagtgttta agatacatat aaattctacc tttaaaaaca acaaaataaa 
Q N 61 acatttttaa aaagcggaaa attataatga aatttatgtt attattttct taaaaattta 

121 aatacatatc aaggttttta atgcaaaaaa ttatagtttt tattttatgt tgttttatga 
181 ctttttttct ttatgcatgt tcttctaaat ttgaaaatgt aaatcctttg gggcgttcat 
241 ttggagaatt tgaagatact gatcctttaa aactaggact tgaacctact tttcctacca 
301 atcaagaaat tccaagttta attagcggtg ctgatttagt acctattact cctattaccc 
361 cacctttaac tagaacaagc aatagtgcca acaataatgc agcaaatggg atcaatcctc 
421 gctttaaaga cgaagctttt aatgatgttt taatttttga aaatcgccct gcggtttctg 
481 attttttaac cattttaggc cctagcggag cagctttaac ggtttgggct ttagcacaag 
541 gaaattggat ttggggctat actttaatcg atagcaaagg atttggcgat gctagagttt 
601 ggcaactttt gctttatcct aatgattttg caatgattaa aaatgccaaa accaatactt 
661 gtcttaatgc ttatggtaat ggaattgtcc attatccttg tgatgcaagc aatcacgcac 
721 aaatgtggaa acttatccct atgagcaata cagcggttca aattaaaaat ttaggaaatg 
781 gaaaatgcat acaagcacct attacaaatc tttatggtga ttttcacaag gtttttaaaa 
841 tttttaccgt agagtgtgca aaaaaagata attttgatca acaatggttt ttaactactc 
901 caccttttac cgcaaaacct ttatatcgcc aaggagaggt acgatgaaaa aaattatatg 
961 tttattttta tcttttaacc ttgcttttgc aaatttagaa aattttaatg ttggcacttg 
1021 gaatttgcaa ggctcatccg cagccacaga aagcaaatgg agtgttagtg taagacaact 
1081 tgtaagtgga gcaaacccct tagatatctt aatgatacaa gaagcaggaa ctttaccaag 
1141 aacagccact ccaacaggac gccatgtgca acaaggtgga acacctattg atgaatatga 
1201 gtggaattta ggaactcttt caaggcctga tagggttttt- atttattatt ctcgcgttga 
1261 tgtaggagct aatcgtgtaa atttagctat agtttcaaga atgcaagctg aagaagtgat 
1321 tgttttacct ccacctacta cagtttcaag acccattata ggaattcgca atggaaatga 
1381 tgcttttttc aatatccatg ctttagctaa tggaggaaca gatgtaggag caattatcac 
1441 agctgtagat gcacattttg caaatatgcc tcaagttaac tggatgatag caggggattt 
1501 taaccgtgat ccttctacta taacaagtac agtggataga gaattagcaa atagaattag 
1561 agtggttttt ccaactagcg caactcaagc aagcggaggg actcttgatt atgcaattac 
1621 aggaaattca aatagacaac aaacctatac tccaccgctt ttagctgcga ttttaatgct 
1681 tgcaagttta agatctcata tagtttcaga tcattttcca gtaaatttta gaaaatttta 
1741 ggacatttaa tatgaaaaaa attattactt tgttttttat gtttataact ttagcctttg 
1801 caactcctac tggagatttg aaagatttta ccgaaatggt ttctataaga agcttagaaa 
1861 cgggaatttt tttaagcgcc tttagggata cctcaaaaga tcctattgat caaaattgga 
1921 atattaaaga aattgtttta agcgatgagt taaaacaaaa agataaatta gctgatgaac 
1981 ttccttttgg ttatgtgcaa tttacaaatc caaaagaaag cgatctttgt ttagccatct 
2 041 tagaagatgg aacctttgga gcaaaatctt gtcaagatga tctaaaagat ggtaaattag 
2101 aaactgtatt ttctataatg ccaacaacaa cttcagctgt gcaaattcgt tctttagttt 
2161 tggaatctga tgaatgtata gtaacttttt ttaatccaaa tattcctata caaaaacgct 
2221 ttggaatagc cccttgcacc ctagatccta ttttttttgc tgaagtaaat gaactaatga 
2281 ttataacccc acctttaaca gctgctaccc ctttagaata agatttttat cttgttctat 
2341 ttttatattt atttaatatt tatgatatta ctaaaataca caaaataatt aataataata 



http://\*wv.ncbi.nlm.nih.gov/entrezM 11/20/02 



// 



• ^ Page 3 of 3 

2401 caatgtaatt taccttgctc tataattttt ttattttaat gtaatttttt gttacaataa 

2461 atttatacat aataattatc ttggaggaaa aattggaaca aattttaaca tggcaacaaa 

2521 tttatgaccc tttttcaaat atttggctaa gtgctttagt ggcattttta cctatactat 

2581 gttttttagt ttgtttggtt 



http://ww.ncbi.nlm.^ 11/20/02 



Page 1 of 2 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 



CDS 



gene 



CDS 



ECCDTABC 2305 bp DNA linear BCT 29-JAN-1999 

Escherichia coli E6468/62 (086:H34)' cytolethal distending toxin 
(cdtA, cdtB, cdtC) genes, complete cds . 
U03293 

U03293.1 GI:416213 

Escherichia coli 
"Escherichia coli 
Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae ; 
Escherichia. 

1 (bases 1 to 2305) 
Scott, D. A. and Kaper,J.B. 

Cloning and sequencing of the genes encoding Escherichia coli 

cytolethal distending toxin 

Infect. Immun. 62 (1), 244-251 (1994) 

94086109 

8262635 

2 (bases 1 to 2305) 
Scott ,D. A. 

Direct Submission 

Submitted (09-NOV-1993) Scott D.A., University of Maryland School 
of Medicine, Geographic Med., Center for Vaccine Dev., 10 South 
Pine Street, Baltimore, MD 21201, USA 
Location/Qualifiers 
1. .2305 

/organism= "Escherichia coli" 
/strain="E6468/62 (086:H34) " 
/specif ic_host="Homo sapiens" 
/ db_xr e f = " t axon : 5 6 2 " 
/clone="pDS7 .96" 
201.. 914 
/gene="cdtA" 
201. .914 
/gene=" cdtA" 
/note="GTG start codon" 
/codon_start=l 
/evidence=experimental 
/ trans l_table=ll 
/product="CdtA" 
/protein_id="AAD10621 . 1" 
/db_xref="GI: 416214" 

/translation^ "MDKKLIAFLCTLIITGCSNGIGDSPSPPGKNVELVGIPGQGIAV 
TSNGATPTLGANNTEFPEVS I MS TGGALLT I WARPVRNWLWG YTPFDS VNFGENRNWK 
VVDGKDAGWKFVWAQGTCMEAFKNGVIHNTCDDNSLSQEFQLLPSTNGNVLIRSSA 
LQTCIRADYLSRTILSPFAFTITLEKCPGAKEETQEMLWAISPPVRAAKPNLIKPELR 
PFRPLP I P PHDKPDGMEGV " 
911. .1732 
/gene="cdtB" 
911. . 1732 
/gene="cdtB" 
/codon_start=l 
/evidence=experimental 
/ transl_table=ll 
/product="CdtB" 
/protein_id="AAD10622 .1" 
/db_xref="GI:416215" 

/translation= " MKKLLFLLMILPGI S FADLSDFKVATWNLQGSNAPTENKWNTHV 
RQL VTGSGAVD I LMVQEAGAVPASATLTEREFSTPGI PMNE YI WNTGTNSRPQELFI Y 
FSRVBAFANRVmAIVSNRRADEVIVLPPPTWSRPIIGIRIGNDVFFSTHALA^ 
DSGAIVNSVFEFFNRQTDPIRQAANWMIAGDFNRSPATLFSTLEPGIRNHVNIIAPPD 



http://\vw.ncbi.nlm.nih.gov/entrez/viewer.fcgi?cmd=&^ 11/20/02 



PTQASGGVLDYAWGNSVSFVLPLLRASLLFGLLRGQIASDHFPVGFI PGRGARR 11 
gene 1729.. 2301 

/gene="cdtC" 
CDS 1729.. 2301 

/gene="cdtC" 

/codon_start=l 

/evidence=experimental 

/ trans l_table=ll 

/product ="CdtC n 

/protein_id="AAD1062 3 . 1" 

/db_xref="GI:416216" 

/translation^ "MKTVIVFFVLLLTGCASEPANQRNLLTQFVGNNAPVDPEPSPVL 
VN I RNVLTGG 1 1 RNP VGSDFNVNNWVI S E VKTNDLDL I S APGGHVQ I KNPDGNEC F AI 
LNGQLAVAKQCSESDRNALFTFITSDTGAVQIKSIGSGQCLGNGESITDFRLKKCVDD 
LGRP FDTVPPGLLWMLNP PL S P AIMS PLTS " 

BASE COUNT 641 a 415 c 547 g 702 t 

ORIGIN 

^ 1 tattgaatag ttttgggggg aatataaaga attatatttg agtatgctgt ttgtgactct 

x\ ' 61 gggaataata acgagcatta taataaactt gtttgttttt ctggtttcgc atttcctcat 

%% 121 taatcttgtt ggtaatattt tcgttgcttt gttgtttcta tttttttata aagaagaggt 

181 ggtgcagagg aggaaataca gtggataaaa aactaattgc atttttgtgc acacttataa 
241 ttactggttg ctcgaatggg atcggtgatt caccttcacc tccgggaaaa aatgtagaat 
301 tggttggaat ccctggacaa ggtattgcag tgacttcaaa cggtgcaact ccaacacttg 
361 gagccaacaa cactgagttt cctgaagttt caataatgag cactggtggg gcgctgctta 
421 ctatttgggc cagacctgtt cgtaactggc tttgggggta tactcctttt gattcagtaa 
481 attttggtga gaatcggaac tggaaggttg tggatgggaa agatgccggc acagtgaaat 
541 ttgttaatgt tgcccagggg acttgcatgg aggcctttaa aaacggggtg atacataata 
601 cctgtgatga taactcgtta tctcaggagt ttcagttact gccttctact aatggtaatg 
661 tgcttataag aagtagtgcc ttgcagacgt gtataagagc agactattta agcagaacta 
721 tattgtcacc gtttgctttt acaatcaccc ttgagaaatg ccctggtgca aaagaagaaa 
781 cgcaagaaat gctatgggca ataagtccac ctgtcagagc ggcaaaacca aatctgatta 
841 agccagagtt aagaccattc agaccattgc caattccacc tcatgacaaa cctgatggaa 
901 tggagggagt atgaaaaaat tattattcct gttaatgatt ttgccgggta tttcttttgc 
961 agatttaagc gattttaaag ttgcaacctg gaatttgcag ggttcaaatg caccgacaga 
1021 aaataaatgg aacacacatg tccgacaact tgttacggga agtggtgctg ttgatatcct 
1081 gatggttcag gaggcagggg cagtaccagc ttctgcaacg ttgactgagc gagaatttag 
1141 cactcctggt attccgatga atgagtatat ctggaatacc ggaaccaata gtcgcccaca 
1201 ggagttgttt atatatttct cacgtgttga tgcattcgct aacagagtaa atcttgcgat 
1261 tgtttcaaac agaagagctg atgaggtgat tgtattacct cctccaactg ttgtatcacg 
1321 accgatcatc ggcattagaa ttggtaatga tgttttcttc tcaacccatg cattggcgaa 
13 81 tcggggcgtg gattcaggag caattgtaaa tagtgttttt gagttcttca acagacaaac 
1441 ggatcctata agacaggccg ctaactggat gattgcagga gattttaacc gttcaccggc 
1501 tacactattt tcaactcttg aaccagggat tcgcaatcat gtaaatatta ttgctccacc 
1561 agatccaacg caagccagtg gtggtgttct tgattatgca gtagttggaa attcagtgag 
1621 ctttgttctt cctctgttga gggcctcgtt gttattcgga ttattaagag ggcaaattgc 
1681 ctctgatcat tttccggttg gctttattcc tggaagagga gcaagaagat gaaaacagtt 
1741 attgtgtttt ttgttttact gctgacaggt tgtgcttctg aacctgcaaa tcagcgtaat 
1801 cttcttactc agtttgtcgg caacaatgcc cctgtagacc ctgaacccag tccagtattg 
1861 gttaatatca gaaacgttct tacagggggg ataatccgaa -atcctgttgg cagtgacttt 
1921 aatgtaaata attgggttat atctgaagta aagactaatg atttggattt gatatcggca 
1981 ccgggagggc atgttcagat taaaaatcct gatggcaatg aatgctttgc tattctaaac 
2041 gggcaattgg cagtggctaa gcagtgctct gaaagtgacc gtaacgcatt gtttacattt 
2101 ataaccagtg atactggggc tgtgcaaatc aagtcaatag gaagcggtca atgcctaggg 
2161 aatggagaga gcattacaga tttcaggtta aaaaaatgtg ttgatgatct tgggcgtcct 
2221 tttgatacgg tgccgccggg gttactctgg atgctgaatc caccattatc tccggcaata 
2281 atgtctccat taacgagctg atctg 

// 
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REFERENCE 
AUTHORS 
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JOURNAL 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



ECU04208 2600 bp DNA linear BCT 07-JUN-1994 

Escherichia coli 9142-88 cytolethal distending toxin (cdtA, cdtB, 
and cdtC) genes, complete cds . 
U04208 

U04208.1 GI:436944 



Gammaproteobac teria ; Enterobac teriaceae ; 



University of Kentucky, 
Micro and Immuno - Chandler 
USA 



gene 



CDS 



gene 



CDS 



gene 



CDS 



Escherichia coli 
^Escherichia coli 
Bacteria; Proteobac teria; 
Escherichia . 

1 (bases 1 to 2600) 

Pickett, C.L. , Cottle,D.L., Pesci,E.C. and Bikah,G. 
Cloning, sequencing, and expression of the Escherichia coli 
cytolethal distending toxin genes 
Infect. Immun. 62 (3), 1046-1051 (1994) 
94156453 
8112838 

2 (bases 1 to 2600) 
Pickett, C.L. 
Direct Submission 

Submitted (09-DEC-1993) C.L. Pickett, 
Microbiology and Immunology, Dept. of 
Medical Center, Lexington, KY 40536, 
Location/Qualif iers 
1. .2600 

/organism= "Escherichia coli" 
/strain="9142-88" 
/ db_xr e f = " t axon : 5 6 2 " 
408. .1184 
/gene= H cdtA" 
408 . . 1184 
/gene="cdtA" 
/ codon_s t ar t = 1 
/ trans l_table=ll 
/product="CdtA" 
/protein_id= "AAA18785 .1" 
/db_xref="GI :436945" 

/translation "MANKRTPIFIAGILIPILLNGCSSGKNKAYLDPKVFPPQVEGGP 
TVPSPDEPGLPLPGPGPALPTNGAIPIPEPGTAPAVSLMNMDGSVLTMWSRGAGSSLW 
AYYIGDSNSFGELRNWQIMPGTRPNTIQFRNVDVGTCMTSFPGFKGGVQLSTAPCKFG 
PERFDFQPMATRNGNYQLKS LSTGLCIRANFLGRTPS S PYATTLTMERCPS SGEKNFE 
FMWSISEPLRPALATIAKPEIRPFPPQPIEPDEHSTGGEQ" 
1181. .1990 
/gene="cdtB" 
1181. .1990 
/gene="cdtB M 
/codon_start=l 
/ trans l_table=ll 
/product="CdtB n 
/protein_id="AAA18786 . 1" 
/db_xref="GI:436946" 

/ 1 ran slat i on= " MKKY IIS L I VFLS F YAQADLTDFRVATWNLQGAS ATTE S KWN IN 
VRQLISGENAVDILAVQEAGSPPSTAVDTGTLIPSPGIPVRELIWNLSTNSRPQQVYI 
YFSAVTDALGGRVNLALVSNRRADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRN 
NDAP7VL VEE VYNFFRD S RDP VHQALNWM I LGDFNRE P ADLEMNLTVPVRRAS E 1 1 S PA 
AATQTSQRTLDYAVAGNSVAFRPSPLQAGIVYGARRTQISSDHFPVGVSRR" 
2005 . .2550 
/gene="cdtC M 
2005. .2550 
/gene="cdtC" 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?cmd=&txt=on& 11/20/02 



^ Page 2 of 2 

/codon_start=l 
/ trans l_table=ll 
/product ="CdtC n 
/protein_id="AAA18787 . 1" 
/db_xref ="GI: 436947" 

/translation="MKKLAIVFTMLLIAGCSSSQDSANNQIDELGKENNSLFTFRNIQ 
SGLM IHNGLHQHGRET IGWE I VPVKTPEE AL VTDQSGW I M I RTPNTDQCLGTPDGRNL 
LKMTCNSTAKKTLFSLIPSTTGAVQIKSVLSGLCFLDSKNSGLSFETGKCIADFKKPF 
EWPQSHLWMLNPLNTES PI I " 

BASE COUNT 834 a 555 c 508 g 703 t 

ORIGIN 

A 1 attaacaaat tacaacaaag atcacattaa ataaaaattg acaattgacc tgtagttcat 

$ 61 catttgtaaa ttcatgattt atatcaatca cgctttgtgt tcggagtaag cttataaatt 

121 acaaaaacga ttaaataaaa aaccacacaa taatattaaa taaaaatacg gtatcgactg 
181 cttttgtttc aaaaggaatt gctattaaaa ctatatactt tcatttagtt ttatcaatta 
241 atgcataact tcaaatgtaa catcaaaaac aatacacctc aaaacaatca cacaaagcaa 
301 caaggacacc caaacaacta aggcactaat aaaaaggaga gtcccaatgt aattctttta 
361 ttcttccatt aatttctact atctttatca taataaggac cataataatg gctaacaaac 
421 gtacacctat ttttatagct ggaatcttga tccccatttt attaaatggt tgctcatcag 
481 gaaaaaataa agcttatctt gaccccaaag ttttccctcc tcaagtggaa ggaggaccaa 
541 ccgttccttc ccccgatgag cccggacttc cattgcccgg gccaggaccg gcgctgccca 
601 caaatggcgc aatccctatc cctgaaccag gtaccgcacc cgcagtatct ttaatgaata 
661 tggatggctc agttctaaca atgtggagcc gcggagctgg ttcatcgtta tgggcgtatt 
721 atatcggcga ctccaattca tttggggaac tacgtaattg gcagattatg cccggaacca 
781 ggccaaatac gatacagttt cgcaatgtag acgttggtac ctgtatgaca agtttcccag 
841 gatttaaagg gggagtacaa ctttctacag caccttgcaa gtttggaccg gaacgtttcg 
901 atttccagcc aatggcaaca cgcaatggta attaccagtt aaaatcttta tctacaggtt 
961 tatgcatcag agcgaatttt ttaggaagaa caccatcatc tccgtacgca acgacattaa 
1021 caatggagcg ttgcccatca agtggagaga aaaactttga attcatgtgg tccataagcg 
1081 aaccattaag gcctgctctg gccactattg ccaagccaga aatacgccca tttccaccac 
1141 agccaataga accagatgag cattcaactg gaggagaaca atgaaaaaat atattatatc 
1201 tctgatagtg tttttatcat tttacgctca agcagattta actgattttc gcgttgcgac 
1261 ctggaatctt caaggtgcat ccgctacgac tgaaagtaaa tggaatataa atgtccggca 
1321 attaatttct ggtgaaaatg ctgtagacat tttagctgta caagaggcag gctctccgcc 
1381 gtcaacggct gtagatacag gtacacttat tccttcccca ggaattcccg tccgagagct 
1441 tatctggaac ttgtcgacaa atagcaggcc acagcaagta tatatatatt tttccgctgt 
1501 tgatgccctc ggtggaagag tcaatcttgc tctggttagc aatcggcggg ccgatgaagt 
1561 gtttgttctt agtcctgtaa gacaaggtgg acgaccattg cttggcatac gaattggtaa 
1621 tgatgcattt ttcactgcac acgccatagc tatgcgaaac aatgatgccc cggctcttgt 
1681 tgaggaagtg tataacttct tccgcgacag cagagaccca gtacaccagg cgcttaactg 
1741 gatgattctt ggtgatttca accgtgaacc tgcggattta gagatgaacc ttactgttcc 
1801 cgtaagaagg gcatcagaaa ttatttcacc agcggcggca acacaaacca gccagcgaac 
1861 attagattat gcagtagcag gaaactctgt ggcatttaga ccatctccgc tacaagcggg 
1921 aattgtatat ggagccagga gaactcaaat atcttcagat catttccctg ttggcgtatc 
1981 cagacgataa aagaggctat cataatgaaa aaattagcaa ttgtttttac tatgctgcta 
2041 atagctggat gctcttcatc acaggattca gctaacaatc agatagatga attaggaaaa 
2101 gaaaacaatt ctctattcac attccgcaat atccaaagtg gcttaatgat ccataatgga 
2161 ttacatcagc atggccgaga gactattgga tgggaaatag tccctgtgaa aacacctgaa 
2221 gaagcacttg ttaccgatca aagcgggtgg ataatgattc gaacgccaaa cacagaccaa 
22 81 tgtttaggga cgcctgatgg aaggaacctg ctaaaaatga cgtgtaattc aacagctaag 
2341 aaaactttgt tttctctcat accgtcaaca acaggggcag tacaaatcaa aagcgttctg 
2401 tctgggcttt gtttcttaga tagtaaaaat agcggattaa gttttgaaac ggggaaatgc 
2461 attgctgact tcaaaaaacc atttgaagtt gtaccacaga gccatttgtg gatgttgaac 
2521 ccattaaata ctgaatcgcc tattatttaa tcccatcatc gcattttgcc gggcacataa 
2581 aaagcattat cataataagt 

// 
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AF373206 468 
Escherichia coli strain MBU. 
(cdtB) gene, partial cds. 
AF373206 

AF3 732 06.1 GI: 203 85547 



bp DNA linear BCT 02-MAY-2002 

E 412 cytolethal distending toxin B 



gamma subdivision; Enterobacteriaceae; 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 



Escherichia coli 
"Escherichia coli 
Bacteria ; Proteobac t eria ; 
Escherichia. 

1 (bases 1 to 468) 

Bouzari,S., 01oomi,M. and Zarepoor,M. 

Identification of cdtB homologs in diarrheagenic E.coli isolates in 
Iran 

Unpublished 

2 (bases 1 to 468) 

Bouzari,S., 01oomi,M. and Zarepoor,M. 
Direct Submission 

Submitted (22-APR-2001) Molecular Biology, Institute Pasteur of 
Iran, Pasteur Ave., Tehran 13164, Iran 
Location/Qualifiers 
1..468 

/organism= M Escherichia coli" 
/strain="MBU. E 412" 
/ db_xr e f = " t axon : 5 6 2 " 
<1. .>468 
/gene="cdtB" 
<1. .>468 
/gene="cdtB" 
/note="CdtB" 
/codon_start=2 
/ trans l_table=ll 

/product= "cytolethal distending toxin B" 
/protein_id="AAM21324 . 1" 
/db_xref="GI :20385548" 

/translation^" ENKWNTHVRQL VTG S GAVD I LMVQE AGAVP AS ATLTERE FSTPG 
IPMNEYIWNTGTNSRPQELFIYFSRV1DAFANRWLAIVSNRRADEVIVXPPPTVVSRP 
IIGIRIGNDVFFSTHALANRGVDSGAIVNSVFEFFNRQTDPIRQAANWMIAGD" 
BASE COUNT 130 a 89 c 115 g 134 t 

ORIGIN 

1 tgaaaataaa tggaacacac atgtccgaca acttgttacg ggaagtggtg ctgttgatat 
61 cctgatggtt caggaggcag gggcagtacc agcttctgca acgttgactg agcgagaatt 
121 tagcactcct ggtattccga tgaatgagta tatctggaat accggaacca atagtcgccc 
181 acaggagttg tttatatatt tctcacgtgt tgatgcattc gctaacagag taaatcttgc 
241 gattgtttca aacagaagag ctgatgaggt gattgtatta cctcctccaa ctgttgtatc 
301 acgaccgatc atcggcatta gaattggtaa tgatgttttc ttctcaaccc atgcattggc 
361 gaatcggggc gtggattcag gagcaattgt aaatagtgtt tttgagttct tcaacagaca 
421 aacggatcct ataagacagg ccgctaactg gatgattgca ggagattt 

// 



CDS 



ORIG 
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REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 



G22P1 2743 bp mRNA linear PRI 05-NOV-2002 

Homo sapiens thyroid autoantigen 70kDa (Ku antigen) (G22P1) , mRNA. 
NM_001469 

NM_001469.2 GI:20070134 

Homo sapiens (human) 
Homo sapiens 

c Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2743) 

Chan, J. Y., Lerman,M.I., Prabhakar , B . S . , lsozaki,0., Santisteban, P . , 

Kuppers,R.C. , Oates,E.L., Notkins,A.L. andKohn,L.D. 

Cloning and characterization of a cDNA that encodes a 7 0-kDa novel 

human thyroid autoantigen 

J. Biol. Chem. 264 (7), 3651-3654 (1989) 

89139411 

2917966 

2 (bases 1 to 2743) 
Reeves, W.H. and Sthoeger , Z . M . 

Molecular cloning of cDNA encoding the p70 (Ku) lupus autoantigen 

J. Biol. Chem. 264 (9), 5047-5052 (1989) 

89174787 

2466842 

3 (bases 1 to 2743) 

Griffith, A. J . , Craft,J., Evans, J., Mimori,T. and Hardin, J. A. 
Nucleotide sequence and genomic structure analyses of the p70 
. subunit of the human Ku autoantigen: evidence for a family of genes 
encoding Ku (p70) -related polypeptides 
Mol. Biol. Rep. 16 (2), 91-97 (1992) 
92301477 
1608402 

4 (bases 1 to 2743) 

Tuteja,N., Tuteja,R. ; Ochem,A. , Taneja,P., Huang, N.W., 
Simoncsits, A. , Susie, S., Rahman, K. , Marusic,L., Chen, J. et al . 
Human DNA helicase II: a novel DNA unwinding enzyme identified as 
the Ku autoantigen 

EMBO J. 13 (20), 4991-5001 (1994) 

95045391 

7957065 

5 (bases 
Koike , M . 
Shiomi,T. 

Chromosomal localization of the mouse and rat DNA double -strand- 
break repair genes Ku p70 and Ku p80/XRCC5 and their mRNA 
expression in various mouse tissues 
Genomics 38 (1), 38-44 (1996) 
97124844 
8954777 

6 (bases 1 to 2743) 
Baumann,P. and West,S.C. 

DNA end-joining catalyzed by human cell-free extracts 
Proc. Natl. Acad. Sci . U.S.A. 95 (24), 14066-14070 (1998) 
99045638 
9826654 

7 (bases 1 
Goedecke , W . , 
Heyting,C. 

Mrell and Ku70 interact in somatic cells, but are differentially 
expressed in early meiosis 
Nat. Genet. 23 (2), 194-198 (1999) 



1 to 2743) 
Matsuda, Y. 



Mimori,T., Harada,Y.N., Shiomi,N. and 



to 2743) 

, Eijpe,M., Of f enberg,H.H. 



van Aalderen,M. and 
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MEDLINE 99438394 
PUBMED 10508516 

COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final 
NCBI review. The reference sequence was derived from BC008343.1. 
On Apr 8, 2002 this sequence version replaced gi:4503840. 
Summary: The G22P1 gene encodes subunit p70 of the p70/p80 
autoantigen. The p70/p80 autoantigen consists of 2 proteins of 
^molecular mass of approximately 70,000 and 80,000 daltons that 
dimerize to form a 10 S DNA-binding complex. See MIM 194364 for 
discussion of the gene encoding the p80 subunit. Exchange of 
immunologic reagents showed that the p70/p80 autoantigen is 
identical to the Ku antigen, the Ki antigen, and the 86- to 70-kD 
protein complex. The p70/p80 complex binds to the ends of 
double -stranded DNA in a cell cycle-dependent manner, being 
associated with chromosomes of interphase cells, followed by 
complete dissociation from the condensing chromosomes in early 
prophase. Both p70 and p80 contain phosphoserine residues. A role 
for the antigen in DNA repair or transposition has been 
proposed. [supplied by OMIM] . 
FEATURES Location/Qualifiers 
source 1 • .2743 

/organisms "Homo sapiens" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/chromosome= "22" 

/map="22ql3 .2-ql3 .31" 

/clone="MGC: 15860 IMAGE : 3509801" 

/tissue_type= "Brain, neuroblastoma" 

/ c lone_l ib= " NIH_MGC_1 9 " 

/ 1 ab Jio s t = " DH 1 0 B - R » 

/note= "Vector: pOTB7" 
gene 1..2743 

/gene="G22Pl" 

/not e=" synonyms: ML8, KU70, TLAA, D22S671, D22S731" 
/db_xref= "Locus ID : 2547 " 
/db_xref = "MIM : 152 690 " 
CDS 656.. 2485 

/gene="G22Pl" 

/note=" thyroid autoantigen 70kD (Ku antigen)" 
/codon_start=l 

/product= "thyroid autoantigen 70kDa (Ku antigen)" 
/protein_id="NP_001460 .1" 
/db_xref="GI: 4503841" 
/ db_xr e f = " Locus ID : 2 5 4 7 " 
/db_xref= "MIM: 152690" 

/ translation^ "MSGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVDA 
SKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAWFYGTEKDKNSVNFKNI 
YVLQELDNPGAKRILELDQFKGQQGQKRFQDMMGHGSDYSLSEVLWVCANLFSDVQFK 
MSHKRIMLFTNEDNPHGNDSAKASRARTKAGDLRDTGIFLDLMHLKKPGGFDISLFYR 
D 1 1 S IAEDEDLRVHFEESSKLEDLLRKVRAKETRKRALSRLKLKLNKDIVI SVG I YNL . 
VQKALKPPPIKLYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQIYGSRQIILEKEET 
EELKRFDDPGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIGSSTLFSALLIKCLEK 
EVAALCRYTPRRNIPPYFVALVPQEEELDDQKIQVTPPGFQLVFLPFADDKRKMPFTE 
KIMATPEQVGKMKAIV^KLRFTYRSDSFENPVXQQHFRNLEALALDLMEPEQAVDLTL 
PKV^AMNKRLGSLVDEFKELVTPPDYNPEGKVTKRKHDN 
HIS KGTLGKFTVPMLKE ACRAYGLKSGLKKQELLE ALTKHFQD " 
misc_f eature 764 . . 2314 

/gene="G22Pl" 

/note="ku; Region: A family of DNA binding domains 
including those in Ku70/80" 
/ db_xr e f = " CDD : LOADjcu " 
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misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



variation 



variation 



variation 



764. .1408 
/gene="G22Pl" 

/note="Ku_N; Region: Ku70/Ku80 N- terminal alpha/beta 
domain. The Ku heterodimer (composed of Ku70 and Ku80) 
contributes to genomic integrity through its ability to 
bind DNA double -strand breaks and facilitate repair by the 
non- homologous end-joining pathway. This is the amino 
terminal alpha/beta domain. This domain only makes a small 
contribution to the dimer interface. The domain comprises 
a six stranded beta sheet of the Rossman fold" 
/ db_xr e f = " CDD : p f amO 3 7 3 1 " 
1451. .2059 
/gene="G22Pl" 

/note="ku; Region: Ku70/Ku80 beta-barrel domain. The Ku 
heterodimer (composed of Ku70 and Ku80) contributes to 
genomic integrity through its ability to bind DNA 
double-strand breaks and facilitate repair by the 
non -homologous end- joining pathway. This is the central 
DNA-binding beta-barrel domain. This domain is found in 
both the Ku7 0 and Ku80 proteins that form a DNA binding 
heterodimer" 

/ db_xr e f = " CDD : p f amO 2 7 3 5 " 
1577. .2017 
/gene="G22Pl" 

/note="Ku78; Region: Ku70 and Ku80 are 70kDa and 80kDa 

subunits of the Lupus Ku autoantigen" 

/ db_xr e f = " CDD : smar t 0 0 5 5 9 " 

2066. .2332 

/gene="G22Pl" 

/note="Ku_C; Region: Ku70/Ku80 C-terminal arm. The Ku 
heterodimer (composed of Ku70 and Ku80) contributes to 
genomic integrity through its ability to bind DNA 
double-strand breaks and facilitate repair by the 
non-homologous end- joining pathway. This is the C terminal 
arm. This alpha helical region embraces the beta-barrel 
domain pfam02735 of the opposite subunit" 
/db_xref ="CDD:pf am03 73 0" 
2372 . .2476 
/gene="G22Pl" 

/note="SAP; Region: SAP domain. The SAP (after SAF-A/B, 

Acinus and PIAS) motif is a putative DNA binding domain 

found in diverse nuclear proteins" 

/db_xref = "CDD : pf am02 037 " 

2372 . .2473 

/gene="G22Pl u 

/note="SAP; Region: Putative DNA-binding (bihelical) motif 
predicted to be involved in chromosomal organisation" 
/ db_xr e f = " CDD : smart 00513" 
1472 

/gene="G22Pl" 

/allele="A" 

/allele="G" 

/db_xref ="dbSNP : 2229034 " 
2256 

/gene="G22Pl" 

/note= "WARNING: map location ambiguous" 

/allele="A" 

/allele="T" 

/db_xref="dbSNP: 1803107" 
2402 



http://www.ncbi.nlm.nih.g^ 11/20/02 



/gene="G22Pl" 

/allele= n A" 

/allele="G" 

/db_xref="dbSNP: 3179615" 
variation 2434 

/gene="G22Pl" 

/allele="G" 

/allele="T" 

/db_xref ="dbSNP: 132788" 
BASE COUNT 732 a 629 c 743 g 639 t 

ORIGIN 

n 1 gcgggccgtt atccatttgt gttgttcgcc agctaggcct ggcctcgtcc cgcttcgctc 
W 61 ggtcggtctc gcgcgccccc atagccttgc tagagggtta gcgttagcct taagtgtgcg 
121 aatccgagga gcagcgacag actcgagacc acgctccttc ctcgggaagg aggcggcacc 
181 tcgcgtttga ggcccgcctg cgtttgaggc ccgcctgcgc ttgcggcccg cctgcgcttg 
241 aggcctgtct gcgtttgaga tctcattggg cgtgattgag gaatttgggg aggtttttgg 
301 gcggtattga ggacgagggg gtccgttagt cagcatagaa tcctggagcg ggaatccctc 
361 accgtctaaa tggcgtcggg ggcgggacct ccgggatctg gcttccgcgg gccgccgccg 
421 gccctgaaac gtgagggata gctgagatga ggcagctact gggatggccc ccatgcgcat 
481 ttacatgcag tccgactgcc gagctttcga ggcagcagga tttaccgtcc acattcctca 
541 ctactaacca agcttttaga acagatctca caagaaccta gaggtcggta ttttttcgat 
601 ttaaatttgc ctgttactga cgttaacgtc tttcgcctag tgagcagtag ccaacatgtc 
661 agggtgggag tcatattaca aaaccgaggg cgatgaagaa gcagaggaag aacaagaaga 
721 gaaccttgaa gcaagtggag actataaata ttcaggaaga gatagtttga tttttttggt 
781 tgatgcctcc aaggctatgt ttgaatctca gagtgaagat gagttgacac cttttgacat 
841 gagcatccag tgtatccaaa gtgtgtacat cagtaagatc ataagcagtg atcgagatct 
901 cttggctgtg gtgttctatg gtaccgagaa agacaaaaat tcagtgaatt ttaaaaatat 
961 ttacgtctta caggagctgg ataatccagg tgcaaaacga attctagagc ttgaccagtt 
1021 taaggggcag cagggacaaa aacgtttcca agacatgatg ggccacggat ctgactactc 
1081 actcagtgaa gtgctgtggg tctgtgccaa cctctttagt gatgtccaat tcaagatgag 
1141 tcataagagg atcatgctgt tcaccaatga agacaacccc catggcaatg acagtgccaa 
1201 agccagccgg gccaggacca aagccggtga tctccgagat acaggcatct tccttgactt 
1261 gatgcacctg aagaaacctg ggggctttga catatccttg ttctacagag atatcatcag 
1321 catagcagag gatgaggacc tcagggttca ctttgaggaa tccagcaagc tagaagacct 
13 81 gttgcggaag gttcgcgcca aggagaccag gaagcgagca ctcagcaggt taaagctgaa 
1441 gctcaacaaa gatatagtga tctctgtggg catttataat ctggtccaga aggctctcaa 
1501 gcctcctcca ataaagctct atcgggaaac aaatgaacca gtgaaaacca agacccggac 
1561 ctttaataca agtacaggcg gtttgcttct gcctagcgat accaagaggt ctcagatcta 
1621 tgggagtcgt cagattatac tggagaaaga ggaaacagaa gagctaaaac ggtttgatga 
1681 tccaggtttg atgctcatgg gtttcaagcc gttggtactg ctgaagaaac accattacct 
1741 gaggccctcc ctgttcgtgt acccagagga gtcgctggtg attgggagct caaccctgtt 
1801 cagtgctctg ctcatcaagt gtctggagaa ggaggttgca gcattgtgca gatacacacc 
1861 ccgcaggaac atccctcctt attttgtggc tttggtgcca caggaagaag agttggatga 
1921 ccagaaaatt caggtgactc ctccaggctt ccagctggtc tttttaccct ttgctgatga 
1981 taaaaggaag atgcccttta ctgaaaaaat catggcaact ccagagcagg tgggcaagat 
2041 gaaggctatc gttgagaagc ttcgcttcac atacagaagt gacagctttg agaaccccgt 
2101 gctgcagcag cacttcagga acctggaggc cttggccttg gatttgatgg agccggaaca 
2161 agcagtggac ctgacattgc ccaaggttga agcaatgaat aaaagactgg gctccttggt 
2221 ggatgagttt aaggagcttg tttacccacc agattacaat cctgaaggga aagttaccaa 
22 81 gagaaaacac gataatgaag gttctggaag caaaaggccc aaggtggagt attcagaaga 
2341 ggagctgaag acccacatca gcaagggtac gctgggcaag ttcactgtgc ccatgctgaa 
2401 agaggcctgc cgggcttacg ggctgaagag tggtctgaag aagcaggagc tgctggaagc 
2461 cctcaccaag cacttccagg actgaccaga ggccgcgcgt ccagctgccc ttccgcagtg 
2521 tggccaggct gcctggcctt gtcctcagcc agttaaaatg tgtttctcct gagctaggaa 
2581 gagtctaccc gacataagtc gagggacttt atgtttttga ggctttctgt tgccatggtg 
2641 atggtgtagc cctcccactt tgctgttctt tactttactg cctgaataaa gagccctaag 
2701 tttgtactaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 
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LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



AF093759 2493 bp DNA linear 

Homo sapiens heat shock protein HSP70 (HSPA7) gene, 
sequence . 
AF093759 

AF09375 9.1 GI : 413 9180 



PRI 14-AUG-2001 
complete 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini ; Hominidae ; Homo . 



1 is 



201-205 (2000) 



Goswami,P.C. , Van Rheeden,R. 



gene 



BASE COUNT 
ORIGIN 

1 

61 
121 
181 



Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 2493) 

Parsian, A. J. , Sheren,J.E., Tao,T.Y., Goswami # P . C . , Malyapa,R., Van 
Rheeden,R., Watson, M.S. and Hunt,C.R. 

The human Hsp70B gene at the HSPA7 locus of chromosome 
transcribed but non- functional 
Biochim. Biophys . Acta 1494 (1-2) 
20525459 
11072087 

2 (bases 1 to 2493) 
Hunt, C.R. , Malyapa,R. , Parsian, A. J. 
and Watson,M.S. 

Direct Submission 

Submitted (22 -SEP-1998) Radiation Oncology, Washington University 
School of Medicine, 4511 Forest Park Blvd., St. Louis, MO 63108, 
USA 

Location/Qualifiers 
1..2493 

/organism="Homo sapiens" 
/ db_xr e f = " t axon : 9 6 0 6 " 
/ chr omo s ome = " 1 11 
/map="lq21.3" 
233 . .2149 
/gene="HSPA7 " 

/note="heat-inducible heat shock protein HSP70; possible 
pseudogene" 
554 a 676 c 794 g 469 t 



cccgggcggg cgggcgggag gctctcgact gggcgggaag gtgcgggaag gttcgcggcg 
9cggggtcgg ggaggtgcaa aaggatgaaa agcccgtgga agcggagctg agcagatccg 
agccgggctg gcggcagaga aaccgcaggg agagcctcac tgctgagcgc ccctcgacgg 
cggagcggca gcagcctccg tggcctccag catccgacaa gaagcttcag ccatgcaggc 
241 cccacgggag ctcgcggtgg gcatcgacct gggcaccacc tactcgtgcg tgggcgtgtt 
301 tcagcagggc cgcgtggaga tcctggccaa cgaccagggc aaccgcacca. cgcccagcta 
361 cgtggccttc accgacaccg agcggctggt cggggacgcg gccaagagcc aggcggccct 
421 gaacccccac aacaccgtgt tcgatgccaa gcggctgatc gggcgcaagt tcgcggacac 
481 cacggtgcag tcggacatga agcactggcc cttccaggtg gtgagcgagg gcggcaagcc 
541 caaggtgcgc gtatgctacc gcggggagga caagacgttc taccccgagg agatctcgtc 
601 catggtgctg agcaagatga aggagacggc cgaggcgtac ctgggccagc ccgtgaagca 
661 cgcagtgatc accgtgccca cctatttcag taactcgcag cgccaggcca ccaaggacgc 
721 gggggccatc gcggggctca aggtgctgcc gatcatcaat gaggccacgg cagcagccat 
781 cgcctatggg ctggaccggc ggggcgcggg aaagcgcaac gtgctcattt ttgacctggg 
841 tgggggcacc ttcgatgtgt cggttctctc cattgacgcc ggtgtctttg aggtgaaagc 
901 cactgctgga gatacccacc tgggaggaga ggacttcgac aaccggctcg tgaaccactt 
961 catggaagaa ttccggcgga agcatgggaa ggacctgagc gggaacaagc gtgccctgcg 
1021 caggctgcgc acagcctgtg agcgcgccaa gcgcaccccg tcctccagca cccaggccac 
1081 cctggagata gactccctgt tcgagggcgt ggacttctac aagtccatca ctcgtgcccg 
1141 ctttgaggaa ctgtgctcag acctcttccg cagcaccctg gagccggtgg agaaggccct 
1201 gcgggatgcc aagctggaca aggcccagat tcatgacttc gtcctggggg gagggctcca 
1261 ctcgcatccc caaggtgcag aagttgctgc aggacttctt caacggcaag gagctgaaca 
1321 agagcatcaa ccctgatgag gctgtggcct atgggtctgc tgtgcaggcg gccgtgttga 
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1381 tgggggacaa atgtgagaaa gtgcaggatc 
1441 tggggctgga gacagcaggt ggggtgatga 
1501 ccaccaagca gacccagact ttcaccacct 
1561 aggtgtatga ggttgagagg gccatgacca 
1621 tcattggcat ccctcctgcc ccacatggag 
1681 atgctaatgg catcctgagc gtgacagcca 
1741 tcaccaatga caagggccgg ctgagcaagg 
1801 agcagtacgg ggctgaggat gaggcccaga 
1861 aggcccatgt cttccatgtg aaaggttctt 
1921 ccgaagagga caggcgcaaa gtgcaagaca 
1981 acaaccagct ggcagagaag gaggagtatg 
2041 gtcgccccat cttctccagg ctctatgggg 
2101 gcgctcaagc ccaccagggg gaccccagca 
2161 tggcccttcg tgataagtca gctgtgactg 
2221 tcttctatga tcctgccctt cagagatgaa 
2281 gaactttctt tccaggataa ctgaagtctt 
2341 tcttctgctt caaataaaaa gtcattaatt 
2401 ctttcaccta tattttgtgt attttgttac 
2461 tagttataga cctaaataaa cttttaaaac 




tcctgctgct ggatgtggct cccctgtctc 
ccacgctgat ccagaggaac gccactatcc 
actcggacaa ccagcctggg gtcttcatcc 
aggacaacaa cctgctgggg cgttttgaac 
tcccccagat agaggtgacg tttgacattg 
ctgacaggag cacaggtaag gctaacaaga 
aggaggtgga gaggatggtt catgaagccg 
gggacagagt ggctgccaaa aactcgctgg 
tgcaagagga aagccttagg gacaagattc 
agtgtcagga agtccttgcc tggctggagc 
agcatcagaa gagggagctg gagcaaatct 
ggcctggtgt ccctgggggc agcagttgta 
ccggccccat cattgaggag gttgattgaa 
tcagggctat gctatgggcc ttctagactg 
gggcttgggg gggtcttccc tccaaagcta 
ttgacttttt ggggggaggg cggttcatcc 
tattaaaact tgtgtggcac tttaacattg 
ttgtatgtat gaattttgtt atgtaaaata 
tec 
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